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ABSTRACT 


The accurate and timely identification of diseases and pests impacting rice cultivation is crucial for farmers, 
allowing for swift intervention and thereby minimizing economic losses. Recent progress in convolutional 
neural networks (CNNs) has significantly boosted the accuracy of image classification, yet their resource- 
intensive nature, demanding significant memory and processing power, underscores the necessity of 
leveraging pre trained models. Additionally, the paper introduces an ensemble approach combining deep 
learning with traditional machine learning methods, further enhancing the effectiveness of disease and pest 
detection in agricultural settings. The paper presents a novel approach leveraging state-of-the-art large- 
scale architecture, ResNet-50, to propose two distinct models: Model-1 integrates ResNet-50 with SVM, 
while Model-2 incorporates ResNet-50 with PCA and SVM to effectively detect and identify rice diseases 
and pests. Through experimentation on authentic and real datasets, the paper demonstrates the effectiveness 
of these models. Additionally, recognizing the constraints of large-scale architectures, especially 
concerning their compatibility with mobile or embedded devices due to processing power and memory 
limitations, the paper introduces and evaluates two proposed models. Notably, Model-2 surpasses Model-1, 
achieving a superior accuracy of 93.7% compared to 91.6% in Model-1. Moreover, Model-2 significantly 
reduces the feature set size by 100% compared to Model-1 through dimensionality reduction using PCA. 


Keywords: Deep Learning, Ensemble Methods, Rice Plant Disease Detection, Pretrained Models, 
Transfer Learning 
challenges from diseases that impact crop quality 


1. INTRODUCTION 


Automated plant disease detection has 
become a vital component of precision agriculture. 
Diseases triggered by fungi, bacteria, and insects 
greatly affect crop yield and productivity. Classifying 
plant leaf diseases presents a significant challenge 
due to striking similarities between classes and 
complex pattern variations. Moreover, changes in 
climate conditions can substantially amplify the 
growth of plant infections. Early detection of diseases 
affecting plant leaves stands as a paramount 
challenge for ensuring optimal productivity in the 
agriculture sector. With diseases accounting for over 
50% of the reduction in plant productivity, timely 
identification facilitates prompt intervention, thereby 
mitigating crop loss. 


In agriculture, rice plants face considerable 


and quantity. This highlights the pressing 
requirement for automated plant disease 
identification and detection to improve yield. Rice 
holds immense importance as a key cereal crop in 
our country, with rice farming playing a pivotal 
role in the agricultural economy. Throughout the 
growth stages of rice, diseases affecting the plants 
represent a primary concern for farmers, leading 
to substantial losses alongside issues such as pests 
and environmental factors. Despite the 
availability of various methods for disease 
detection in crops, such as image processing and 
remote sensing, these systems often exhibit poor 
accuracy. 


The remarkable growth of the Indian 
population necessitates significant enhancements 
in cultivation and agricultural practices. Rice, 
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being a staple crop, holds unparalleled importance as 
the most coveted food crop throughout the nation [1]. 
Nevertheless, rice stands out as a crop highly 
susceptible to plant diseases, which consequently 
impacts cultivation and overall profitability. To 
mitigate such challenges and bolster crop yields, it's 
imperative to anticipate and prevent plant diseases at 
their early stages[2]. Forecasting diseases through 
visual observation alone is often slow, occasionally 
imprecise, and can result in heightened 
expenses.Estimating disease types is challenging and 
susceptible to errors [3]. The challenges stem from a 
lack of comprehensive understanding of the plant. 
Consequently, if diseases in rice plants are not 
anticipated or identified in their early stages, it 
adversely impacts rice production, a trend observed 
over recent decades[4]. The image processing 
technique for disease detection typically involves 
several steps, including image acquisition, 
preprocessing, segmentation, feature extraction, and 
classification [5]. Handcrafted feature extraction for 
rice plant disease detection comes with inherent 
limitations[6]. Firstly, crafting effective features 
manually demands specialized domain knowledge 
and consumes significant time, posing challenges in 
capturing all pertinent information comprehensively. 
Moreover, handcrafted features may lack robustness 
in generalizing across various types of rice plant 
diseases and diverse environmental conditions. 
Furthermore, this approach exhibits limited 
adaptability, necessitating manual intervention for 
feature modifications to address evolving disease 
patterns or incorporate new data. The limitations 
associated with the utilization of Convolutional 
Neural Networks (CNNs) for detecting diseases in 
rice plants include the requirement for extensive 
labeled data, susceptibility to overfitting, and 
computational complexity, presenting challenges in 
resource-limited settings[6]. 


The current research paper proposes an 
innovative methodology that integrates a 
Convolutional Neural Network (CNN) utilizing a 
ResNet-50 architecture with conventional machine 
learning methods like Principal Component Analysis 
(PCA) and Support Vector Machine (SVM). The 
study utilizes a large dataset comprising images of 
healthy and unhealthy rice plants collected from real- 
world environments. By employing a pretrained 
ResNet model for feature extraction, training time 
overhead is minimized. Two models are proposed: 
Model-1 involves feeding ResNet-50 output features 
directly into an SVM classifier, while Model-2 
employs PCA to reduce feature dimensionality before 
inputting them into an SVM _ for classification. 


Benchmark datasets are utilized to assess the 
performance of the proposed models. It turned out 
that Model-2 outperforms Model-1. This 
approach is assessed using performance metrics 
including accuracy, precision, recall, and Fl 
score. 


2. RELATED WORK 


In agricultural image recognition and 
classification, two primary _ technological 
categories dominate: deep learning and traditional 
machine learning[7]. Notably, deep learning has 
shown rapid advancements and notable 
achievements in this domain. For example, 
Mohanty et al. have successfully developed a 
deep learning model with the ability to identify 14 
different crop species and detect 26 various crop 
diseases.[8]. Previous research efforts in the realm 
of rice plant infection recognition and 
classification have been limited. Lu et al. [9] 
introduced a novel method employing Deep 
Convolutional Neural Network (DCNN) for 
predicting rice plant diseases. Their research 
utilized a dataset comprising numerous images 
depicting both healthy and diseased paddy stems 
and leaves, resulting in enhanced accuracy when 
compared to conventional machine learning 
methods. Furthermore, a segmentation model 
was developed by Dhingra et al.[10] based on 
neutrosophic logic, originating from fuzzy set 
theory theory to estimate Regions of Interest 
(ROI). This model utilized three Membership 
Functions (MFs) for segmentation, employing 
feature subsets for predicting the presence of 
plant leaf infections based on segregated sites. 
Islam et al. [11] introduced an innovative method 
for predicting and classifying rice plant diseases. 
Their approach employs image processing (IP) 
techniques to detect diseases by analyzing the 
proportion of RGB values in the affected area. In 
the context of paddy leaf disease prediction, 
automated disease detection is facilitated through 
the utilization of image processing (IP) 
techniques [12]. In this investigation, a hybridized 
methodology integrating grayscale co-occurrence 
matrix, Discrete Wavelet Transform (DWT), and 
Scale Invariant Feature Transform (SIFT) was 
adopted for feature extraction. These extracted 
features were subsequently employed in 
conjunction with various machine learning 
classifiers to distinguish between healthy and 
diseased crops. 
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Kaya et al. [13] analyzed the outcomes of 
implementing four distinct Transfer Learning (TL) 
strategies for plant classification utilizing Deep 
Neural Networks (DNNs) on four standard datasets. 
Their findings underscored the substantial advantages 
of TL in automated plant prediction, thereby 
enhancing the performance of plant disease 
classifiers. In a prior study [14], A Convolutional 
Neural Network (CNN) model was utilized to 
forecast weeds in soybean crop _ images, 
distinguishing between grass and broadleaf weeds. 
The image dataset encompassed various soil, 
soybean, broadleaf, and grass weed images. CNN, 
employed for Deep Learning (DL), yielded optimal 
results in image recognition tasks. 


3. THE PROPOSED RICE PLANT DISEASE 
DETECTION MODEL 


3.1 Framework Overview 


Our research endeavors in rice plant disease 
identification employ a dual approach, integrating 
transfer learning and traditional machine learning 
algorithms, delineated in Fig. 1 and Fig. 2, where Fig. 
1 corresponds to Model-1 and Fig. 2 to Model-2. 
Initially, rice plant images are categorized based on 
predefined classes as detailed in Table 1, then 
processed through a pretrained ResNet-50 model via 
transfer learning. This model, initially trained on the 
ImageNet dataset featuring 1000 classes, adeptly 
extracts multi-level features. Leveraging transfer 
learning enables us to conserve computational 
resources by retaining the pretrained network 
parameters. 


In Model-1, the pretrained ResNet-50 model 
automatically extracts features, yielding embeddings 
utilized for classification via an SVM machine 
learning classifier. Conversely, in Model-2, the model 


E-ISSN: 1817-3195 
again extracts features, followed by 
dimensionality reduction through Principal 


Component Analysis (PCA), ensuring retention of 
99% variance. PCA serves a dual role: reducing 
computational overhead and _ mitigating 
overfitting risks inherent in high-dimensional 
embeddings. The streamlined embeddings are 
then fed into an SVM classifier for classification. 
Further elucidation on these methodologies is 
provided in subsequent sections. 


3.2 Overview of Model-1 


Figure 1 showcases an elaborate block 
diagram outlining our proposed model for rice 
plant disease detection. The illustrated flowchart 
delineates the sequence of steps, commencing 
with the input of rice plant images into the model. 
This model incorporates a pretrained ResNet-50 
model, proficient at feature extraction from the 
input images. Subsequently, these extracted 
features are fed into an SVM classifier to 
precisely categorize the images into their 
respective disease classes.A notable advantage of 
our approach is the integration of a pretrained 
ResNet-50 model [15], significantly reducing 
training time while leveraging the model's 
expertise in feature extraction from images. 
Additionally, we capitalize on the efficiency of 
using a Support Vector Machine (SVM) [16] 
classifier instead of Artificial Neural Networks 
(ANN) [17], enables our model to operate 
seemlessly even on systems possessing limited 
restricted resources, such as mobile devices or 
embedded systems. This strategic choice ensures 
that our model remains accessible and applicable 
across a wide range of platforms, including those 
with low-end resources. 


Inputimage Bee ResNet-50 (jae SVM 
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Figure 1: Model-1 Integrated Framework For Disease Detection In Rice Plants. 
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3.3 Overview of Model-2 


Figure 2 depicts a detailed block diagram of 
our innovative framework designed for the detection 
of diseases in rice plants. The flowchart visually 
delineates the sequential process, commencing with 
the input of rice plant images into the model. An 
integral component of this model is the inclusion of a 
pretrained ResNet-50 model, renowned for its 


aus fe - ea 


adeptness in feature extraction from images. 
Following feature extraction, the features undergo 
dimensionality reduction through PCA, 
effectively reducing the number of features. 
Finally, the reduced-dimensional features are 
obtained inputted into an SVM classifier, ensuring 
precise classification of the images into their 
respective disease classes. 
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Figure 2 : Model-2 Integrated Framework For Disease Detection In Rice Plants. 


An evident strength of our methodology lies in the 
incorporation of a pretrained ResNet-50 model, 
markedly diminishing training time while capitalizing 
on its proficiency in extracting features from images. 
Moreover, by implementing PCA [21] to condense 
the feature set, we optimize computational efficiency 
without sacrificing accuracy. Embracing the efficacy 
of a Support Vector Machine (SVM) [22] classifier 
over Artificial Neural Networks (ANN) [23] further 
enhances our model's adaptability, facilitating 
seamless operation even on resource-constrained 
platforms like mobile or embedded systems. This 
strategic decision ensures the accessibility and 
applicability of our model across diverse platforms, 
including those with limited resources. 


3.4 ResNet-50 


ResNet-50 [24] is a convolutional neural 
network architecture renowned for its depth and 
efficacy in image-related tasks. With 50 layers, it 
excels in feature extraction from input images 
typically sized at 224x224 pixels in RGB format. Its 
output dimensions vary depending on the task, often 
culminating in 1000 classes for ImageNet 
classification. Notably, ResNet-50's innovation lies in 
its skip connections, which directly link layers, 
addressing the issue of vanishing gradients and 
enhancing smoother gradient flow during the training 


process. Further enhancing its efficiency, ResNet- 
50 incorporates bottleneck blocks, where 1x1 
convolutions reduce computational complexity 
without compromising representation power. 
Lastly, it adopts global average pooling as its final 
layer, summarizing feature maps into a compact 
representation, instead of the traditional fully 
connected layers, contributing to its widespread 


adoption and success in computer vision 
applications. 
3.5 Pea 

PCA [25], an extensively utilized 


unsupervised machine learning technique, serves 


multiple purposes such as_ dimensionality 
reduction, visualization, noise filtering, and 
feature extraction[26]. Consider a dataset 


comprising grayscale images of rice plants [27], 
each sized at 200x200 pixels, resulting in a total 
of 40,000 pixels per image. Upon applying PCA, 
we observe that the first 25 components retain 
80% of the variance, while the first 75 
components capture 90% of the variance. 
Consequently, we can compress the original 
40,000 pixels into a 75-dimensional vector, 
preserving at least 90% of the image information. 
This condensed 75-dimensional vector can then 
be referred to as the pixel features of the rice 
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plant images. We have used PCA in our proposed 
Model-2 , where it has reduced the number of 100352 
features generated from ResNet-50 to 941 features 
reduced to 100% by retaining the main information of 
the images. To mitigate overfitting during the training 
process, we'll apply Principal Component Analysis 
(PCA) to the features extracted from ResNet-50, 
aiming to retain 99% of the variance. This step is 
crucial for its potent dimensional reduction effect, 
ensuring the preservation of essential information 
[28]. 


3.6 Svm 


Support Vector Machines (SVMs) [29] are 
utilized for classifying a rice image dataset consisting 
of 9 classes, encompassing 5 diseases, 3 pests, and 1 
healthy class. SVMs operate by determining the 
hyperplane that optimally segregates these classes in 
the feature space, aiming to maximize the margin 
between classes. Employing kernel functions, SVMs 
accommodate linear and non-linear classification, 
facilitating the classification of rice images afflicted 
by various diseases and pests. Their robustness 
against overfitting makes SVMs well-suited for this 
task, contributing to their efficacy in identifying and 
distinguishing between different rice health 
conditions [30]. 


4. RESULTS AND DISCUSSION 


In this section, a sequence of tests was 
carried out on a laptop T490, featuring a Core i7 CPU 
and 16 GB of memory. The experimental 
configuration employed Jupyter Notebook operating 


with Python 3.7. Initially, tests were executed on 
the dataset utilizing Model-1, succeeded by 
experiments with Model-2. Following this, an 
extensive comparison and assessment were 
undertaken, scrutinizing performance across 
diverse evaluation metrics including accuracy, 
precision, recall, Fl score, and feature count [31] 


4.1 Dataset 


Rice plants are vulnerable to a multitude 
of diseases and pests, which can impact various 
parts of the plant. This study encompasses nine 
classes, including five diseases, three pests, and 
one healthy plant class. The classification is 
detailed in Table 1, with certain diseases grouped 
together due to similar treatment methods and 
occurrence patterns. Symptoms manifest in 
different parts of the rice plant, with diseases like 
Bacterial Leaf Blight and pests like Brown Plant 
Hopper primarily affecting the leaves. Other 
diseases such as Sheath Blight and pests like 
Stemborer target the stem, while Neck Blast and 
False Smut affect the grains. To prevent confusion 
between diseased and dead plant parts, images of 
both have been included in the dataset. Note that 
Sheath Blight, Sheath Rot and their simultaneous 
occurrence have been considered in the same 
class, because their treatment method and place of 
occurrence are the same. Additionally, some 
classes exhibit multiple symptom variations, also 
detailed in Table 1, to encompass all observed 
variations found in BRRI's paddy fields. Sample 
images for each class are illustrated in Figure 3. 


Table 1: Collection Of Images Representing Various Classes. 
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The dataset utilized in this research, 
comprising a collection of 1426 images depicting 
various rice diseases and pests has been gathered. in 
real-life scenarios Bangladesh Rice Research 
Institute's [BRRI] paddy fields, is made publicly 
available by Chowdhury R. Rahman et al. [15]. The 
dataset encompasses nine classes, including eight 
classes representing various rice diseases and pests, 


4169 


along with one class denoting healthy rice plants. 
Access to the dataset can be found through the 
provided link 
(https://drive.google.com/open?id=1ewBesJcguri 
VTX8sRJseCDbXAF T4akK) [15]. Additional 
information about the dataset is available in the 
cited publication [15], where the author utilized it 
for implementing a simple CNN. 
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(b) Brown Plant Hopper Pest 
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(c) Bacterial Leaf Blight disease 


(d) Neck Blast disease 


(f) Hispa pest 


(g) Sheath Blight & Sheath Rot 
disease 


(h) Brown Spot disease 


—- 


(i) Healthy crop 


Figure 3 : A Sample Image Of Each Detected Class. 


4.2 Experimental findings 


In this study, we conducted experiments to 
evaluate the performance of two models, referred to 
as Model-1 and Model-2, for a specific task. The 
experiments were carried out using the Rice dataset, 
wherein two models were proposed. In Model-1, an 
ensemble model comprising a pretrained ResNet-50 
and SVM was employed, while Model-2 utilized an 
ensemble model consisting of ResNet-50, PCA, and 
SVM. Specifically, the training and testing sets were 
divided in a 8:2 ratio. For PCA, components were 
selected to retain 99% variance. 


Model-1 utilizes a pretrained ResNet-50 
model for feature extraction. The ResNet-50 model 
generated a total of 100,352 features from the input 
data. These features were then fed into a Support 
Vector Machine (SVM) classifier for classification. 
The accuracy achieved by Model-1 was measured to 
be 91.6%, with an F1 score of 81.7. The recall and 
precision rates were found to be 82.4% and 81.8%, 
respectively. 


Model-2 also utilizes a pretrained ResNet-50 
model for feature extraction.Furthermore, Principal 


Component Analysis (PCA) was employed to 
decrease the dimensionality of the extracted 
features, leading to a notable reduction in the 
feature count down to 941, representing almost 
100% reduction compared to Model-1. The 
reduced-dimensional features were then fed into 
an SVM classifier for classification. The final 
outcomes of Model-2 demonstrated improved 
performance, with an accuracy of 93.7% and an 
Fl score of 86.4. The recall and precision rates 
for Model-2 were measured at 87.8% and 89.6%, 
respectively. 


4.3 Comparative analysis of Proposed Models 


To evaluate the performance and 
generalizability of our proposed models, 
experiments were carried out using a dataset 
consisting of 1426 images of paddy. This dataset 
consists of 9 classes, including 8 diseased classes 
and | healthy class. All images are standardized 
to a size of 256x256 pixels with 3 channels. The 
dataset is partitioned into training and testing sets 
at a ratio of 8:2. Evaluation metrics encompass 
precision, recall, Fl-score, and the number of 
parameters. 
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Accuracy = (Tn+Tp) / (Ta+Fp+Tp+Fn) 

Precision = Ty / (Tp+Fp) 

Recall = Tp / (Tpt+Fn) 

Fl Score= 2 * ( Precision * Recall ) / ( Precision + 
Recall ) 


In binary classification, T) : True positive, 
Tn : True negative, F, : False positive, and F, : False 
negative are technical terms used to evaluate 
classifier performance. Specifically, T, refers to 
positive samples correctly classified, Tn denotes 


negative samples correctly classified, F, indicates 
positive samples misclassified, and F, represents 
negative samples misclassified [32]. 


The experimental comparison results are 
presented in Table 2. These experiments were 
conducted using a designated experimental 
environment, ensuring consistency and 
reproducibility in the results obtained.The results 
offer valuable insights into the efficacy of the 
proposed models for the given task. Here, we 
provide a comparative analysis of their 
performance. 


Table 2 : The Metrics From The Comparison Experiments. 


Model-2 exhibits superior performance over 
Model-1 across all metrics tested on the rice image 
dataset. Despite having a_ significantly smaller 
number of features (941 compared to Model-1's 
100,352), Model-2 demonstrates higher accuracy 
(0.937 vs. 0.916), precision (0.896 vs. 0.818), recall 
(0.878 vs. 0.824), and Fl-score (0.864 vs. 0.817). 
This suggests that Model-2 is more adept at 
accurately identifying rice instances within the image 
dataset, outperforming Model-1 even with a 
substantially reduced feature set. 


Additionally, Model-2's higher precision 
indicates its greater ability to correctly identify true 
positives, while its higher recall suggests it captures a 
larger proportion of actual rice instances, ultimately 
leading to a better balance between minimizing false 
positives and false negatives. Model-2 emerges as the 
preferred choice for the rice image dataset due to its 
consistently higher performance across all evaluated 
metrics. Its effectiveness in achieving high accuracy, 
precision, recall, and Fl-score despite a significantly 
smaller number of features underscores its efficiency 
in rice image classification tasks. This highlights the 
importance of model architecture and feature 
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selection in optimizing performance, as Model-2 
showcases that superior results can be achieved 
with a more streamlined approach. 
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Figure 4: : Comparison Of Model-1 and Model-2 Evaluation Metrics 


The below is the confusion matrix [Table 
3] generated on test sample’s using Model-1. The 
confusion matrix illustrates how well the model 
performs in classifying various rice diseases and 
health states. Strong identification of healthy rice 


evident, with notable misclassifications in 
distinguishing between Blast and Blast affected 
high (Bpns), and in classifying False Smut (F,;) and 
Shudra Rot (Sp:). These findings highlight both the 
model's strengths and areas for refinement in rice 


(H) and diseases like Hispa (His) and Blast (Bip) is disease classification. 
Table 3 : Confusion Matrix Generated Using Model-I. 
Bp | 23 0 0 0 0 0 0 0 0 0 
Bpte 0 9 0 0 0 1 1 0 0 0 
Bons 0 3 0 0 0 0 0 0 D) 0 
Teneahele eee 0 0 0 b| 0 0 0 0 0 0 
Fy 1 0 0 @ | B | i 0 1 0 0 
H 1 0 0 0 i 0 0 D 0 
His 2 1 0 0 0 2 16 0 3 0 
Np 0 0 0 0 0 0 0 56 0 0 
Sor 0 0 0 0 0 1 0 0 | 48 0 
Stm 0 0 0 0 0 0 0 1 1 42 
Bp | Bore | Bons | Bs | Fs | H | His | No | Spr | Stm 
Predicted Label 


The below is the confusion matrix [Table 4] 
generated on test sample’s using Model-2. The 
updated confusion matrix demonstrates improved 
performance over the previous one, with fewer 
misclassifications and enhanced accuracy in disease 


classification. Notably, distinctions between Blast 
and Blast affected high (Bpns) and other diseases 
like Hispa (His) are clearer, resulting in more 
accurate predictions. These enhancements 
highlight the model's refined capability in 
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rice 
and 


diseases, 
showcasing 


accurately identifying various 
addressing previous limitations 
improved classification accuracy. 


This confusion matrix reveals insights into 
the performance of the classification model across 
multiple classes representing various plant diseases 
and conditions. Examining the diagonal elements, 
which indicate correct predictions, it's apparent that 
the model performs exceptionally well in identifying 
Healthy (H) plants, with 36 out of 40 instances 
correctly classified. Additionally, the model 
demonstrates strong performance in detecting Neck 
Blast (Nb) and Brown Spot (Bs), with all instances 
accurately classified. 


However, some misclassifications are 
evident, particularly in distinguishing between 
Brown Plant Hopper early symptoms (Bphe) and 
False Smut (Fs), where three instances of Bphe 
were incorrectly classified as Fs. Similarly, there 
are instances where Hispa (His) was misclassified 
as Brown Plant Hopper severe symptoms (Bphs), 
suggesting a potential overlap or similarity 
between these conditions. Despite these 
misclassifications, the overall performance of the 
model appears commendable, with the majority of 
classes accurately predicted. 


Table 4 : Confusion Matrix Generated Using Model-2. 


Bp :Bacterial leaf blight, Bphe :Brown plant hopper early symptoms,Bpns : Brown plant hopper severe 
symptoms, B; :Brown spot, F, :Flase smut, H: Healthy, His : Hispa, Np : Neck Blast, Sp, : Sheath blight 


and/or sheath rot, Sim : Stemborer 


In model-1, although it demonstrated a 
high accuracy and precision across several classes 
such as "Healthy" and "Neck Blast", it struggled 
with the early prediction of "Brown plant hopper 
severe symptoms" (Bphs) and "Hispa" (His) 
classifications, where it misclassified a notable 
portion of instances. For instance, in the case of 
Bphs, it incorrectly classified 3 instances as "Hispa" 
and 2 instances as "Brown spot". Similarly, in the 
case of His, it misclassified 2 instances as 
"Healthy" and 3 instances as "Neck Blast". These 
misclassifications suggest a weakness in model-1's 
ability to discern subtle symptoms and distinguish 


between closely related classes. The pie charts 
illustrating these performances are provided below 
the table, offering a visual representation of the 
distribution of correct and incorrect predictions for 
each class, shedding further light on the model's 
classification capabilities. 


On the other hand, model-2 displayed 
improved performance in predicting early 
symptoms of "Brown plant hopper severe 
symptoms" (Bphs) and "Hispa" (His) compared to 
model-1. It correctly classified a higher number of 
instances for these classes, indicating its enhanced 
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sensitivity to subtle symptoms and finer 
distinctions. For example, in the case of Bphs, 
model-2 correctly classified 2 instances, which 
were misclassified by model-1. Similarly, for the 
His class, model-2 correctly classified all instances, 
while model-1 misclassified several instances. This 
suggests that model-2 may be more adept at 


capturing nuanced patterns and making accurate 
predictions for these particular classes. Similarly, 
the pie charts depicting the class-wise performances 
of model-2 are located below the table, providing 
visual insights into its classification accuracy. 


S.No | Disease/Class Name Model-1 Performance Model-2 Performance 


1 Baterial leaf blight 


‘Bacterial ea bight Cassifeation Bacterial et bight -lasscation 


2 Brown plant hopper 
early symptoms 


Brown plant hopper (early symptoms) - Classification 


ww 


Brown plant hopper (severe symptoms) - Classification 


3 Brown plant hopper 
severe symptoms 


Brown spot - Classification Brown spot - Classification 


4 Brown spot @ 


Incomect predictions 


5 Flase smut 
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Figure 5 : Class Wise Distribution Of Predictions Of Model-1 (Left) And Model-2 (Right) 


The comparison of the two models reveals 
that both exhibit high accuracy, with Model-2 
outperforming Model-1. Model-2 demonstrates a 
lower rate of incorrect predictions (6.3%) compared 
to Model-1 (8.4%), resulting in a higher percentage 
of correct predictions (93.7% vs. 91.6%). This 
indicates that Model-2 offers enhanced precision in 


classifying rice diseases, potentially leading to 
more reliable diagnoses. The pie chart 
visualizations underscore these findings, depicting 
a marginally improved distribution of predictions in 
Model-2, further highlighting its superior 
performance in disease classification tasks. 
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Figure 6 : Overall Distribution Of Predictions Of Model-I (Left) And Model-2 (Right) 


5. CONCLUSION 


In conclusion, our study underscores the 
critical importance of accurate disease and pest 
identification in rice cultivation for effective 
agricultural management. Leveraging pretrained 
models, such as ResNet-50, in conjunction with 
advanced machine learning techniques like support 
vector machine (SVM) and principal component 
analysis (PCA), as demonstrated by Model-1 and 
Model-2, our research unveils promising avenues for 
enhancing classification accuracy in agricultural 
settings. 


The utilization of state-of-the-art 
architecture like ResNet-50, already trained on large 
datasets, combined with machine learning 
methodologies, showcases the potential for enhanced 
disease and pest detection. Particularly noteworthy is 
the superior performance of Model-2, achieving an 
impressive accuracy of 93.7% while significantly 
reducing feature set size through PCA. These 
findings not only contribute to advancing agricultural 
technology but also address practical challenges such 
as model scalability and compatibility with resource- 
constrained environments. 


By providing insights into optimizing model 
efficiency and performance, this research contributes 
to the development of accessible and effective tools 
for disease and pest management in rice cultivation, 
thereby promoting agricultural sustainability and 
economic resilience. 


Looking ahead, there is substantial potential 
in further exploring pretrained models and integrating 
location, weather, and soil data with images of 
diseased plant parts to devise a comprehensive and 
automated plant disease detection system. Further 


exploration into segmentation or object detection 
algorithms could amplify the efficacy of rice 
disease and pest classification, especially in 
environments with diverse backgrounds. 
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